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Local asymptotic minimax risk bounds in 
a locally asymptotically mixture of normal 
experiments under asymmetric loss 
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Abstract: Local asymptotic minimax risk bounds in a locally asymptotically 
mixture of normal family of distributions have been investigated under asym- 
metric loss functions and the asymptotic distribution of the optimal estimator 
that attains the bound has been obtained. 

1. Introduction 

There are two broad issues in the asymptotic theory of inference: (i) the problem 
of finding the hmiting distributions of various statistics to be used for the pur- 
pose of estimation, tests of hypotheses, construction of confidence regions etc., and 
(ii) problems associated with questions such as: how good are the estimation and 
testing procedures based on the statistics under consideration and how to define 
'optimality', etc. Le Cam [12] observed that the satisfactory answers to the above 
questions involve the study of the asymptotic behavior of the likelihood ratios. Le 
Cam [12] introduced the concept of 'Limit Experiment', which states that if one is 
interested in studying asymptotic properties such as local asymptotic minimaxity 
and admissibility for a given sequence of experiments, it is enough to prove the 
result for the limit of the experiment. Then the corresponding limiting result for 
the sequence of experiments will follow. 

One of the many approaches which are used in asymptotic theory to judge the 
performance of an estimator is to measure the risk of estimation under an appro- 
priate loss function. The idea of comparing estimators by comparing the associated 
risks was considered by Wald [19, 20]. Later this idea has been discussed by Hajek 
[8], Ibragimov and Has'minskii [9] and others. The concept of studying asymptotic 
efficiency based on large deviations has been recommended by Basu [4] and Ba- 
hadur [1, 2]. In the above context it is an interesting problem to obtain a lower 
bound for the risk in a wide class of competing estimators and then find an estima- 
tor which attains the bound. Le Cam [11] obtained several basic results concerning 
asymptotic properties of risk functions for LAN family of distributions. Jeganathan 
[10], Basawa and Scott [5], and Le Cam and Yang [13] have extended the results 
of Le Cam for Locally Asymptotically Mixture of Normal (LAMN) experiments. 
Basu and Bhattacharya [3] further extended the result for Locally Asymptotically 
Quadratic (LAQ) family of distributions. A symmetric loss structure (for example. 
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squared error loss) has been used to derive the results in the above mentioned ref- 
erences. But there are situations where the loss can be different for equal amounts 
of over-estimation and undcr-cstimation, e. g., there exists a natural imbalance in 
the economic results of estimation errors of the same magnitude and of opposite 
signs. In such cases symmetric losses may not be appropriate. In this context Bhat- 
tacharya et al. [7], Levine and Bhattacharya [15], Rojo [16], Zellner [21] and Varian 
[18] may be referred to. In these works the authors have used an asymmetric loss, 
known as the LINEX loss function. Let A = ^ - 61, a 7^ and 6 > 0. The LINEX 
loss is then defined as: 

(1.1) /(A) = 6[exp(aA) -aA- 1]. 

Other types of asymmetric loss functions that can be found in the literature are as 
follows: 




CiA, for A > 
— C2A, for A < 0, Ci,C2 are constants, 



or 

. , _ J \w{d)L{A), for A > 0, (over-estimation) 
^ ' y w{9)L{A), for A < 0, (undcr-cstimation), 

where 'L' is typically a symmetric loss function, A is an additional loss (in percent- 
age) due to over-estimation, and w{9) is a weight function. 

The problem of finding the lower bound for the risk with asymmetric loss func- 
tions under the assumption of LAN was discussed by Lcpskii [14] and Takagi [17]. 
In the present work we consider an asymmetric loss function and obtain the local 
asymptotic minimax risk bounds in a LAMN family of distributions. 

The paper is organized as follows: Section 2 introduces the preliminaries and the 
relevant assumptions required to develop the main result. Section 3 is dedicated to 
the derivation of the main result. Section 4 contains the concluding remarks and 
directions for future research. 



2. Preliminaries 

Let Xi, . . . , Xn be n random variables defined on the probability space (X,A, Pe) 
and taking values in (5, S) , where S is the Borel subset of a Euclidean space and 
S is the (7-field of Borel subsets of S. Let the parameter space be O, where O is 
an open subset of . It is assumed that the joint probability law of any finite 
set of such random variables has some known functional form except for the un- 
known parameter 6 involved in the distribution. Let An be the cr-ficld generated 
by Xi, . . . , Xn and let Pe^n be the restriction of Pg to An- Let 6^) be the true value 
of 6 and let 9n = 0^ + Snh (h e R^), where (5„ ^ as n ^ 00. The sequence Sn 
may depend on 9 but is independent of the observations. It is further assumed that, 
for each n > 1, the probability measures Pea,n and Pe„,n are mutually absolutely 
continuous for all ^0 and 6n- Then the sequence of likelihood ratios is defined as 

LnO^n] ^0, ^n) = Ln{9o, 9n) ~ 



dPeo,n ' 



where X„ = (Xi, . . . ,X„) and the corresponding log-likelihood ratios are defined 
as 

K{0o,0n) = \0g Ln{9o,9n) = log ■ 



dP0o,r, 
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Throughout the paper the foUowmg notation is used: (l)y{iJ,,a'^) represents the nor- 
mal density with mean /i and variance cr^; the symbol '=J>' denotes convergence 
in distribution, and the symbol denotes convergence in Peo,n probability. 

Now let the sequence of statistical experiments En = {Xn, An, P9.n}n>i be a 
locally asymptotically mixture of normals (LAMN) at G ©■ For the definition of 
a LAMN experiment the reader is referred to Bhattacharya and Roussas [6] . Then 
there exist random variables Zn and Wn [Wn > a.s.) such that 

(2.1) A„(0o, ^n) = log - hZn + h'Wn ^ 0, 

and 

(2.2) (Z„, Wn) ^ {Z, W) under Pe„n, 

where Z = W^/'^G, G and W are independently distributed, W > Q a.s. and 
G ~ Af(0, 1). Moreover, the distribution of W does not depend on the parameter h 
(Le Cam and Yang [13]). 

The following examples illustrate the different quantities appearing in equations 
(|2.ip and (|2.2p and in the subsequent derivations. 



Example 2.1 (An explosive autoregressive process of first order). Let the random 
variables Xj,j = 1, 2, . . . satisfy a first order autoregressive model defined by 

(2.3) Xj=9Xj-i+€,,Xo^Q,\e\>l, 

where e^ 's are i.i.d. N{0, 1) random variables. We consider the explosive case where 
\0\ > 1. For this model we can write 

f,i0) = /(x,>i, . . . 0) a e^^(-^-''--^)'. 

Let ^0 be the true value of 9. It can be shown that for the model described in 
(12. 3p we can select the sequence of norming constants (5„ = '•^"^"'"^ so that (12. ip and 

"o 

(|2.2p hold. Clearly 5„ ^ as n — > oo. We can also obtain WniOo), Zn{Oo) and their 
asymptotic distributions, as n — > oo, as follows: 

Wn{9o) = ° 2„ asn~^oo, where W ^ xl and 

Oo 

n n 11 

Gn{0o) = {Y,xurHY.^j-^'o) = {J2xf_,)^9n -9)^g, 
j=i j=i j=i 

where G ~ iV(0, 1) and 9n is the m.l.e. of 9. Also 

ZniOo) = WS{9^)Gn{9n) = V (E-^J-i^j) ^ = ^' 

where 14^ is independent of G. It also holds that 

{Zn{9Q),Wn{9a))^{Z,W). 

Hence Z\W ~ A^(0, M^). In general Z is a mixture of normal distributions with W 
as the mixing variable. 
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Example 2.2 (A super-critical Galton- Watson branching process). Let {Xq = 
l,Xi,. . . , Xn} denote successive generation sizes in a super-critical Galton- Watson 
process with geometric offspring distribution given by 

(2.4) p{Xi = j) = e-\i - e-y-\ i = i,2,...,i<^<oo. 

Here E{Xi) = 9 and V{Xi) = a'^{9) = 0{9 - 1). For this model we can write 
m=f{xj\x^,...,xj.,;e) = (1- i)-.-.-(i)-.-. 

Let ^0 be the true value of 6. Here 6n can be chosen as "^If/V^^ • For this model 
the random variables Wn{Oo), Zn{Oo) and their asymptotic distributions are: 

= fcil => as n ^ oo, 

where W is an exponential random variable with unit mean. Here 

n n 

GniOo) = [0o{0o - l)]-^(5^X,_i)-^ Y^^Xj - eoXj_,) ^ G, 

where G ~ A'^(0, 1), and for W independent of G, 

Zn{0o) = wi(^o)G„(^o) ^ wiG. 

It also holds that 

{z„{eo),Wr,{eo))^iw'-G,w). 

The decision problem considered here is the risk in the estimation of a parameter 
9sR^ using an asymmetric loss function Throughout the rest of the manuscript 
the following assumptions apply: 

Al l{z)>OioT all z,z = 9 -9 

A2 l{z) is non increasing for z < 0, non-decreasing for z > and 1{Q) = 0. 
A3 / oo/o l{w~i z)e~i™^^ g{w)dwdz < oo for any c > 0, where g{w) is the 
p.d.f. of the random variable W. 

^ 1 o 1 12 

A4 / ^ Jq w2 z l{'w~2d — z)e~'^'^'^^ g{w)dwdz < oo, for any c,d> 0. 

Define la{y) = min(Z(j/),a), for < a < oo. This truncated loss makes l{y) 

bounded if it is not so. 
A5 For given W = w > 0, h{P,w) = j^^l{w^^ (3 — y)(j)y{{),w~^)dy attains its 

minimum at a unique (3 = [3{){w), and EI3o{W)) is finite. 
A6 For given W = w > 0, any large a,b > and any small A > the function 

h{P, w) = jfy^la{w-'^(3 - y)<j>yH), ((1 + X)w)-')dy 

attains its minimum at /9(w) = ${a, b, A, w), and E^{a, b, A, W) < oo. 

A7 lima^oo,6^oo,A^o P{a, b, A, w) = Po{w). 

A8 E{W-i) < oo. 

Note the following: 

1. Assumptions A3 and A4 are general assumptions made to ensure the finiteness 
of the expected loss and other functions. Assumptions A5 and A6 are satisfied, 
for example, by convex loss functions. 
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2. If is symmetric, then (3o{w) = = /3(a, b, A, w). 

3. If /(.) is unbounded, then the assumption A8 is replaced by A8' as E{W^^ x 
Z^'liW-iz)) < oo. 

Here we wiU consider a randomized estimator S,{Z, W) which can be written as 
£,{Z,W,U), where U is uniformly distributed on [0,1] and independent of Z and 
W. The introduction of randomized estimators is justified since the loss function 
Z(.) may not be convex. 



3. Main result 



Under the set of assumptions and notations stated in Section 2 we can have the fol- 
lowing generalization of Hajek's result for LAMN experiments under an asymmetric 
loss structure. 

Lemma 3.1. Let satisfy assumptions Al - A7 and let Z\W be a normal random 
variable with mean 6^/w-\-I3q{'w) and variance 1. Further let W be a random variable 
with p.d.f. g{w), then for any e > there is an a — a{e) > and a prior density 
tt{0) so that for any estimator S,{Z, W, U) satisfying 

(3.1) P,^,mZ,W,U)-W'iz\>e)>e 
the Bayes risk i?(7r,^) is 

(3.2) = J 7r{9)E{la{az,W,U)-9)\9)d9 

> / l{w^^ I3q{w) ~ y)4>y{Q^w~^)g{w)dydw + a. 



1 ^ 

Proof. Let the prior distribution of 9 be given by tt{9) — 'na{9) — (27r)^ 2 cr^^e^ 2^ , 
cr > 0, where the variance a^, which depends on e as defined in (j3.1[) . will be 
appropriately chosen later. As cr^ — > 00, the prior distribution becomes diffuse. 
The joint distribution of Z, W and 9 is given by 

(3.3) f[z\w)g{w)^{9) = (27r)- V-ie~5(^-(«-'+/3o(-)))'-iflg(y,), 

The posterior distribution of 9 given [W, Z) is given by 'ip{9\w, z), where ip{9\'w, z) 
is ^^( "'"lf(~y^\ and the marginal joint distribution of {Z, W) is given by 

(3.4) /(z, w) = 0z(/?o(w), a^r{w, (j))g{w), 

where the function r{s,t) = s + 1/t^. Note that the Bayes' estimator of 9 is 
r^w^'l^^"' ^^'^ when the prior distribution is sufficiently diffused, the Bayes' 
estimator becomes W~^{Z — f3o{W)). 

Now let e > be given and consider the following events: 

\W-^Z ^ (3n{W))\ < M, — = \{— -1)<W <m. 
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Then 



M _ M e 
- a^r{W, a) ~ '^^wTT ~ 2' 



Now, for any large a,b > 0, we have 



(3.6) 



5 

la{({z,w,u) - e)il;{e\z,w)de 



-b 

b 



W2 {z- Pojw)) 1 

— ^ )0y(O, 

r[w,a) r(w,a) 



where v ~ 6 — ""^(^ /3o(tu)) ^ Now, since 6*12, w jy>- a'^(z /^oCm)) _i — \ have 

y\z, w ^ N{0, ^). It can be seen that w, ^^)- Ift^^ " -u^-^oMI > f • 
Hence due to the nature of the loss function, for a given w > 0, we can have, from 
(EH), 



laiCiz, w, u) ^ y)0y(O, -. rjrfy 

-b r{w,a) r{w,a) 

/Vb ^ 2 
laiw-^Po{w) - y)0y(O, -)dy 
-Vb r[w,a) 

_i~ 1 

>/ laiw 2(3{a,b,X,w)-y)(f>y{0,— r)dy + S 

J-Vb r{'w,a) 



(3.7) 



h{f3{a,b,X,w)) + S, 



where S > depends only on e but not on a,b,a^ and (|3.7[) holds for sufficiently 
large a, b, (here A = ^ as ^ oo and — < w < m). 

A simple calculation yields 



h{l3{a, b, A, w)) 
rVb 



(3.8) 



la{w I3{a,b,\,w) - y)4}y{0,— -)dy 

-Vb r(w,cr) 

/■^ 1 ~ I y'^ 

> / la{w—2(3{a,b,X,w)-y)M0,-){l~^)dy 
J-Vb w 

/Vb 2 2 

la(w~'^~^(a, b, A, w) - y)^(j)y{0, -)dy. 
-Vb w 
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Hence 



/oo 
-oo 



' — oo 

> f TT{0)E{la{aZ,W,U)-0))d0 

(3.9) = / / / / la{^{z,w,u)-e)i;{e\z,w)f{z,w)d9dudwdz 

> J h{l3o{w))g{w)dw X P{\W-^^{Z - (3o{W))\ >b~Vh)-^ 

+ SPmZ,W,U)-W-i Z\> e, \W-^Z-(3o{W))\<M,—<W<m}, 

m 

using (|3.7p . p.Sp and assumption A4, where A: > does not depend on a, b, . Let 
A — {{z, w, u) g (— oo, oo) X (0, oo) X (0, 1) ■\S,{z, w,u) — z\ > e, 

\w~^{z - /3o(w))| < M, — < w < to}. 

TO 

Then P{A\e = 0) > | for sufficiently large M due to ([XT]) . Now under = the 
joint density of Z and W is (j)z{(3o{w), l)g{w). The overall joint density of Z and W 
is given in (|3.4p . The likelihood ratio of the two densities is given by 



fiz,w) 



f{z,w\e = o) 



-1 / ^(z-/3o(t»))'' " , 



and the ratio is bounded below on {(z, w) : \w 2 (z — (3o{'w))\ < M, < w < to} 
by (T^^r(TO, cr)^^ = — 17. Finally we have 

(mcr^ + l) 2 

(3.10) P{A)= [ [ f{z,w,u)dzdwdu> ^ ^ 

Ju=0 J A 



{ma^ + 1)2 2 

Hence for sufficiently large to and M, from (|3.9p . we have 

R{7T{d),0 > I h{Mw))9{w)dw[l - " ,J - 4 + ^ 



2/l(/3o(w))^ fT2 2(TOr72 + l)4' 

assuming F[|T4^-^(Z - MW))\ <h^Vb]>l- 2^^^. That is, 

R{A0),O> I h{P,{w))g{w)dw-^-\ + 5'- / 

Putting (5f (77icr2 + i)-i/2 _ ^ = ^ we find i?(7r(6'),$) > / h{l3o{w))g{w)dw + a. 
Hence the proof of the result is complete. □ 

Theorem 3.1. Suppose that the sequence of experiments satisfies LAMN 

conditions at 6 € Q and the loss function meets the assumptions A1-A8 stated 
in Section 2. Then for any sequence of estimators {Tn} of 6 based on Xi, . . . , Xn 
the lower bound of the risk of {T„} is given by 

limliminfsup Eg{l{S::\Tri ~ 6))} > 1{I3q{w) - y)(t)JQ,w-'^)g{w)dydw 
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Furthermore, if the lower bound is attained, then 



— Hw;:;^) 



or, as (T^ — > oo 



s;^\Tn -e)-Wn - Po{W)) ^ 0. 

Proof. Since the upper bound of values of a function over a set is at least its mean 
value on that set, we may write, for sufficiently large n, 

sup Ee{l{5-\Tn-e))}> [ 7r{h)EmS-\Tr,-t)-h)\0 = t + 5nh}dh, 

whatever the values of constants a, b and the prior density 7r(/i) may be. Now, let 
Z\w, 9 = t + Snh ^ N{9w'2 + f3o{w), 1). Then we can fix some S > and choose a, b 
and 7r(.) in such a way that 



/_ 



6 

Tr{h)E{la{^{Z, W, U) - h)\t + Snh}dh 

b 

> j l{f3o{w) - y)<l>y{0, w~^)g{w)dydw - 5, 



for any estimator ^(Z, W, U). 

Next we use Lemmas 3.3 and 3.4 of Takagi [17], where we set 

Sn = 5-\Tn - t), A„,t = WnHZn " PoiW)), and 
5„(A„,( =x,U = u)= mi{y : P(5„ < y\An,t = x) > u}. 

Let Fn^h = distribution of Sn under Pn^tn Fnh ^ distribution of S'„(A„ j,t/) = 
S,n{Zn, W, U) under Pn,h, where U ~ Uniform (0, 1) and is independent of A„^t; Gn,h 
is the distribution of A„_t and G* ^ is the distribution of A^ = Wi{Z — Po{W)). 
As a consequence of this we have (Takagi [17], p. 44) 

lim I \F,,,h - Fl^W = and lim | - G^^jj = 0. 

Now for any estimator ^n{Zn, Wn, U) = 5,i(A„^i, U) and for every heR^ we have 

\E[la{5-\Tn -t)- h)\t + 5nh] - E[la{Sn{An,t, U) - h)\t + 5nh]\ 

and 

\E[l^{Sn{An,t, U) - h)\t + 5nh] - E[la{UZ, W, U) - h)\t + 0. 

Finally 

j Tr{h)E{l{S;;\Tn -t)- h)\e = t + 5nh}dh 

> I ^{h)E{la{in{Z,W,U)-h)\t + 5nh}dh 

J-b 

> j l{Po{w) - y)(j)y{0,w~^)g{w)dydw, for n > n(a, 6, tt) 

which proves the result. □ 
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Example 3.1. Consider the LINEX loss function as defined by It can be seen 
that ?(A) satisfies all the assumptions A1-A7 stated in Section 2. Here a simple 
calculation will yield 

and h{(],w) attains its minimum at (3o{w) — — ^ ^f^^ and h{l3o,w) = 
4. Concluding remarks 

From the results discussed in Le Cam and Yang [13] and Jeganathan [10] it is clear 

that under symmetric loss structure the results derived in Theorem 3.1 hold with 

_ 1 _ 1 

respect to the estimator Wn ^(^o)^n(^o) and its asymptotic counterpart W'^Z. 

Here due to the presence of asymmetry in the loss structure the results derived in 

Theorem 3.1 hold with respect to the estimator Wn^ {0o){Zn{Oo)-l3o{W)) + f3Q{W) 

and W-i{Z - po{W)) + (3o{W). 

Now Wnh0o){Z.n{eo) - Po{W)) ^ W'^iZ - (3o{W)). Hence the asymptotic 
bias of the estimator under asymmetric loss would be E{W~^ {6o){Z — /3o{W)) + 
PoiW) - 0) - Eie + l3„iW) -e)= E{l3o{W)). 

Consider the model described in Example 2.1. Under the LINEX loss we have 
Po{w) = —^-;;j/2 (vide Example 3.1). Here the asymptotic bias of the estimator 
would be E{(3o{W)) = —^E{W~i), which is finite due to Assumption A8. 

The results obtained in this paper can be extended in the following two direc- 
tions: (1) To investigate the case when the experiment is Locally Asymptotically 
Quadratic (LAQ), and (2) To find the asymptotic minimax lower bound for a 
sequential estimation scheme under the conditions of LAN, LAMN and LAQ con- 
sidering asymmetric loss function. 
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